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ABSTRACT 

We discuss the stationary states of a model economy in which N heterogeneous adaptive consumers purchase 
commodity bundles repeatedly from P sellers. The system undergoes a transition from an inefficient to an 
efficient state as the number of consumers increases. In the latter phase, however, price fluctuations may be 
much larger than in the inefficient regime. Results from dynamical mean-field theory obtained for N — ► oo 
compare fairly well with computer simulations. 
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1. INTRODUCTION 

Systems of heterogeneous units interacting competitively - like producers and consumers in modern economies, 
traders in financial markets, drivers on city roads and highways or users accessing a computer network - pose 
serious coordination problems of both theoretical and practical relevance. It is reasonable to think that each 
agent in such situations aims at optimizing his or her individual performance making decisions on the basis of 
available information and experience. However, private goals will most certainly be conflicting, which makes the 
reach of a globally efficient phase where the collective use of resources is optimal far from certain. More generally, 
the interplay between different macroscopic properties in these systems may turn out to be quite subtle. It is clear 
that one of the crucial points that a theory of these systems should address is what resource load distributions 
may emerge and under which conditions. Unfortunately, the mathematical framework of general equilibrium 
theory [1] does not appear to be able to provide answers to these problems. Its main stumbling block consists 
in the fact that, from a technical viewpoint, it is rather difficult to extract robust macroeconomic laws from the 
underlying microeconomic assumptions while maintaining the crucial ingredient of heterogeneity [2] (see also [3,4] 
for alternative approaches that try to overcome these problems). 

On the other hand, agent-based models in which agents' behavior is described by simplified stochastic laws 
are often more amenable to tackle these issues. A remarkable example is given by the Minority Game [5,6], 
whose elementary framework allows to elucidate many important aspects by addressing directly the relation 
between microscopic behavior and macroscopic properties (like fluctuations, predictability and efficiency). From 
a physical viewpoint, these models are relatives of mean- field spin glasses [7] and neural networks [8]. They can 
thus be studied in detail using the statistical mechanics toolbox for disordered systems, by which one can derive 
equations for the relevant observables while fully preserving heterogeneity at the level of agents. 

Here we introduce and study both analytically and numerically a model closely related to both the batch 
Minority Game [9] and to the traffic model introduced in [10] in which the subtle interdependence of macroscopic 
properties is particularly striking. We aim at describing the adaptive dynamics of N consumers who on each day 
select their consumption among P possible commodities or producers using a minimum expected cost criterion 
and learning from experience. The system turns out to reach a steady state whose macroscopic properties, such as 
the distribution of consumer choices over commodities, are to a large extent independent of the microscopic details 
for N — ► oo. Upon increasing the ratio N/P one observes a transition from a regime where some commodities 
are over- or under-used (and therefore some sellers are more convenient than others) and the average price is 
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high to one in which consumers are uniformly distributed over resources and the average price is low. However 
while the former phase is characterized by contained price fluctuations, in the latter fluctuations may be much 
larger. This type of scenario is not dissimilar to that found in Minority Games (although details are somewhat 
different). The present model however provides hopefully a more concrete ground for testing it against empirical 
data. 

2. MODEL DEFINITIONS 

We consider a system with TV consumers and P producers or sellers. For our purposes, one may either think that 
different producers sell different goods, or that each seller supplies a different variety of the same broad category 
of commodities (for instance, perishable goods). On each day t = 1,2,..., every consumer i has to acquire one 
of S possible bundles of commodities, for instance for his or her subsistence. A bundle is a vector q ig = {<z^} 
such that q^ g denotes the amount of goods i demands from seller fi (fi £ {1, . . . , P}). g G {1, . . . , S} labels the 
different bundles of each consumer. Consumers are heterogeneous, in the sense that different consumers have 
different needs and thus different possible bundles. We assume that sellers set the daily price of commodities 
according to the demand they receive, denoted by D^{t), so that the higher the demand the higher the price. 
Hence prices do not enter directly in the model, but just through the demands. Each consumer on the other 
hand aims at purchasing, on each day, the bundle he or she finds more convenient, labeled by gi(t), with the 
limitation that when the choice is made the price at which the purchase will take place is not known yet (it is 
determined by the collective decision of all consumers, which form the demands). Hence they try to learn the 
convenience of different bundles from experience in order to be able to predict which bundle will have the highest 
marginal utility on any given day. The events taking place on each day t can be summarized by the following 
scheme: 

9i(t) = argmaxf/ i9 (i) (1) 

U ig (t + 1) - U ig (t) = ~E < W fc l ( 3 ) 

At the decision stage, Eq. (1), each consumer chooses the bundle which carries the highest (cumulated) utility 
Ui g (t). The different choices are then aggregated, Eq. (2), and the (normalized) demands are formed. Finally, 
(3), utilities are updated with the following rationale: if the demand of a commodity \i is above a certain 
threshold k, consumers perceive that commodity as too costly and the utility of bundles including it will tend to 
be reduced; similarly, if the demand has been lower than k the commodity will be seen as 'cheap' and will tend 
to increase the utility of the bundle. (This mechanism is conceptually identical to that employed in standard 
Minority Games.) The utility of the bundle is then determined by the demands of all commodities in the bundle 
though a simple average. (This is what makes the model more similar to a batch Minority Game.) 

In principle, k could be agent- and commodity-dependent. For the sake of conceptual and technical simplicity, 
we ignore this possibility here. We assume that bundles q ig are quenched random vectors with probability 
distribution 

p (q ig ) = II K 1 - iW 1 O + ( 4 ) 

(0 < q < 1 being the probability that any given commodity is part of a bundle) that are assigned to consumers 
independently on i and g on day n = and are kept fixed. In this way, we introduce a further simplification in 
that each seller is either visited or not by a consumer, and the purchased quantities play no role. Moreover, we 
are implicitly assuming that the different goods are equivalent to consumers, that is there is no commodity that 
all consumers will need to buy. Finally, we assume that the learning dynamics (3) is initialized at values Ui g (0) 
about which more will be said later on. 



We concentrate our attention on the macroscopic properties of the steady state(s). The observable by which 
they will be characterized is given by the magnitude of demand fluctuations, 

a = pE[((^) 2 )-^) 2 ] ( 5 ) 

(here and in what follows, (. . .) stands for a time average in the stationary state of (3)). Because of our 
assumptions on the relation between prices and demands, A quantifies the typical spread of prices in the economy. 
A measure of how evenly consumers are distributed over producers is instead given by 

H =^Y,( D "-~D) 2 (6) 

where D = 1 — q is the expected demand. If H = 0, each seller receives on average the same demand so that 
none of them is perceived as more convenient. In this case, consumers are distributed uniformly over producers. 
If H > 0, instead, the distribution of demands is not uniform and some producers are seen as more or less 
convenient than others. When H > an external agent who watches the economy from the outside trying to 
identify the best bargain would manage to find more convenient sellers and make a profit. When H = 0, instead, 
this would not be possible. So one sees that transitions from regimes with H > to regimes with H — can 
be seen as transitions between inefficient and efficient states of the economy, where by efficient state we mean 
one where goods flow from producers to consumers in such a way that no information exploitable by an external 
agent is generated. States that are optimal from a collective perspective have both H = and A small, because 
on one hand efficiency is desirable and on the other price fluctuations should be such that agents have as much 
cost certainty as possible on a day by day basis. Hence H and A describe intertwined properties, and it is on 
their mutual dependence that we shall focus in what follows. 

3. DYNAMICAL SOLUTION FOR S = 2 (SKETCH) 

The above model can be solved exactly for 5 = 2 (two possible bundles per consumer) resorting to dynam- 
ical techniques developed in the context of mean-field spin glass theories, which allow to obtain a complete 
macroscopic characterization of the stationary states in the limit N — > oo. In this limit, the most remarkable 
phenomenology is obtained when the number of sellers scales linearly with N, i.e. when lini/v— >oo N/P = n is 
finite. This is the case we will consider henceforth. In this section we will limit ourselves to a sketch of the broad 
lines of this approach, which requires a calculation that can be carried out (modulo some necessary modifications) 
following the lines traced in [9] and subsequent papers for the batch Minority Game (see [11] for the most recent 
work and for references). The resulting theory correctly describes the steady states for the particular choice 
k = 1 — q, that is for the case in which the threshold equals the expected price of commodities. This simplifies 
the calculation considerably Different choices of k (which may lead to significantly different physics) will not be 
considered here. 

The solution relies on the introduction of the auxiliary variables 

€< = *^ «, = *^ and Pi {t)= Uil{t) - Ui2{t) (7) 

in terms of which the bundle selected by consumer i on day t can be written as Qi gi m — + s i(t)£i with 
Si(t) = sign[pj(t)]. The latter - Ising spins - are ultimately the relevant microscopic dynamical variables of the 
problem. We call pt the 'preference' of agent i, since if Pi(t) > (resp. p%{t) < 0) he/she selects bundle 1 (resp. 
2). The time evolution of the preferences is governed by the equation 



Pi(t + i)-j*(t) = ~£tf 



p 



+ hi(t) ( 8 ) 



where we added a small external probing field hi(t) for later use. One sees that our system is described by a set 
of N coupled Markovian 'zero temperature' processes with quenched disorder. In this case, like often in models 



with mean-field interaction [12], the steady states can be completely described in the limit N — > oo in terms of 
two-time macroscopic correlation and response functions: 



c(t, f) = i y: ((Sim G( t , f) = i y: o) 

where the double brackets represent an average over realizations of (8) at fixed disorder {u)i, and the over- line 
stands for an average over the disorder. The canonical method to obtain equations for these quantities in these 
problems consists in evaluating the dynamical partition function 

Z[ip] - ^e iE M"(*W«W^ = /" e iS ^' 5i( * )v,iW p(p(0)) i S (equation {8))d Pi (t)] (10) 

i,t 

(with p(p(0)) the distribution of initial conditions) which satisfies 

^-.fe5?SM ^'-.fe^SR^F) <"> 

This can be done in two steps. First, the disorder average is carried out, generating two-time dynamical order 
parameters such as Q(t,t') — (l/N)J2i s i(t) s i{t')- Next, the limit N — > oo is performed via a saddle-point 
integration. At the relevant saddle, the dynamical order parameters can be identified with the correct correlation 
and response functions. This in turn leads to a process describing the behavior of a single 'effective' consumer 
via a stochastic non-Markovian equation whose generic form is 

P (t + 1) - P (t) = h(t) + ]T R(t, t')s(t') + V (t) (12) 

v 

where R(t, t') is known as the retarded self-interaction kernel and r](t) is a zero-average Gaussian noise with a 
non-trivial covariance matrix (t](t)r](t')). Different models ultimately result in different forms of R(t,t') and of 
(i](t)rj(t ')) . Both these quantities turn out to depend only on the correlation function C = {C(t, t')} and on the 
response function G = {G(t,t')}, which can in turn be obtained from the statistics of the effective consumer's 
spin s(t) = sign[p(£)] generated by the dynamical mean-field equation (12): 

C(t,t') = (s(t)s(t')) G (t,t') = ^±- (13) 



For the model described here, the retarded self-interaction kernel reads 

R= _Q0^Q)_ [1 + q{1 _ q)G] -i (14) 



where 1 = {6tt>}, while the noise is conveniently written as r)(t) = y/q(l — Q)/ n z (t) with covariance matrix 
(z{t)z(t')) = A(t,t') and 

A = [1 + q(l - q)G]- 1 [q(l - q)(E + C)] [1 + q(l - q)G}- 1 (15) 

where E = {1} (for all t,t'). 

In principle, once (12) is obtained, it is possible to calculate C(t,t') and G(t,t') at all times (notice that time 
has been kept finite up to now). In what follows, we shall concentrate on extracting the macroscopic behavior 
in the limit t — > oo. This will be later compared to numerical simulations. 



4. STATIONARY STATE OF THE DYNAMICAL MEAN-FIELD EQUATIONS: 
PERSISTENT ORDER PARAMETERS AND FLUCTUATIONS 



In order to calculate the steady state properties, it is necessary to formulate suitable Ansatze for the asymptotic 
behavior of correlation and response functions. We assume here that the system reaches an ergodic steady state, 
in which both are time-translation invariant and there is no anomalous response. This corresponds to requiring 
that [12] 



lim C(t + r,t) = C(t) lim G(t + r,t) = G(r) 



t— i-OO 



X := lim V G(t) < oo 

T— >00 ^— ' 

t<T 

limG(M') = W finite 



(16) 
(17) 

(18) 



(top to bottom: time translation- invariance, absence of anomalous response, absence of long-term memory). 
Numerically, one observes that in the steady state some agents always buy the same bundle while others keep 
flipping between their possible bundles. Borrowing Minority Game jargon, we call the former, for which \pi(t)\ 
grows linearly with time, 'frozen' and the latter, for which preferences remain finite as t — > oo, 'fickle'. Their 
respective contributions to the steady state properties can be studied separately. Defining pit) — p(t)/t one has, 
from (12), 

(19) 



f<t t"<t< 



where R is given by (14) and the noise covariance by (15). Taking the limit t — > oo and neglecting the external 
field this becomes 

q(l-q)s , . Iq(l-q) 



P = -- 



+ 



provided one defines 



n[l + q{l-q) X ] 
p= lim p(t) s= lim - Vs(t') z= lim - \^ z(t') 



i' t' 

(x is the integrated response defined in (17)). The variance of z is obtained from (15): 



<z 2 )= lim ly y A( t ',t")= ^-g)( 1 + c ) 



(20) 
(21) 

(22) 



t'<t t"<T 



where c is the persistent autocorrelation c — limt^oo(l/t) J2t' ^(*')- Now for a frozen consumer p is non-zero 
and s is either 1 (for p > 0) or — 1 (for p < 0). This leads to the condition \z\ > 7 with 



v / g( 1 - q)/ n 

l + q{l-q)X 



(23) 



Similarly, for a fickle consumer p = and s = z/7, which corresponds to \z\ < 7. One may now easily derive a 
self-consistent equation for c = (s 2 ): 



c = (0(\z\ 7 )> + (^ 2 ^(7 - M)/7 2 ) = + ^ 



(24) 



where = (0(|z| — 7)) = 1 — erf(A/\/2) is the fraction of frozen consumers, <j) — 1 — (j) is the fraction of fickle 
consumers, and A = 7/ ^/ (z 2 ) = 1/ y/n{l + c). In order to obtain an equation for x, we can proceed as follows: 



X 



<9.s 



1 



ds 



y/q(l-q)/n \ dz / 7y/q(l-q)/ 



(25) 



where we used the fact that frozen agents are insensible to small perturbations and thus do not contribute to 
the response. Eq.s (24) and (25) can be easily solved numerically to obtain c and x as a function of n for 
different q. Notice that when \ diverges (17) is no longer valid and the current theory breaks down. Using the 
definition of 7, this turns out to happen when n is such that n§ = 1. This condition is independent of q. Minor 
manipulations performed imposing it to (24) lead to the critical value n c = 2.9638 . . ., which represents the point 
above which the ergodic solution obtained starting from (16), (17) and (18) must be replaced by a non-ergodic 
one, in particular by one with long-term memory for which the steady state depends on the initial conditions of 
the dynamics. We will not consider this region analytically here. However, we will show by simulations that the 
above picture is correct. 

As we stated before, most of our attention is placed, rather than on c and \-> on A and H. A is particularly 
hard to calculate exactly, since it is not in general a function of the persistent order parameters only (both 
long-term and short-term fluctuations contribute to it). However, it is possible to obtain rough estimates for 
both quantities in terms of c and \ alone from the matrix S = |A, using the approximate method first employed 
in [9] to calculate the volatility of the batch Minority Game. This procedure consists essentially in separating the 
persistent contribution to H from the non-persistent one and in neglecting the autocorrelation of fickle consumers. 
We shall omit details of the (long but straightforward) derivation here. Let it suffice to say that H and A have 
the following approximations: 

H = g(1 - g)(1 + G) 2 (26) 
2[l + q(l-q)x] 2 



g(l-g)(^-c) l f 
2[l + q(l-q) X } 2 2 



Notice that at n c , i.e. when \ diverges, H vanishes. This means that H behaves roughly as a physical order 
parameter: for n > n c the system reaches an "ordered" state in which the distribution of consumers over 
producers is uniform. The behavior in the limit n — > can be obtained with a minimal algebraic effort. It turns 
out that in this limit c <~ n so vanishes. The same can be said for x, so that finally H — > q(l — q)/2 and 
A -> q(l - q)/2. 



5. COMPARISON WITH NUMERICAL RESULTS 

We have performed computer simulations of the model with k = 1 — q for different q and initial conditions, fixing 
the product NP — 16000. We shall first discuss the case 5 = 2, for which the theory outlined above holds. Later 
the dependence on S will be addressed. We only consider the case q < 1/2 since analytically all macroscopic 
properties turn out to be invariant under transformations q — > 1 — q. We have verified numerically that this is 
indeed so (not shown). 

Fig 1 shows the behavior of H and c as a function of n for various q. As expected, H is similar to a physical 
order parameter. Its behavior indicates that as the number of consumers increases they tend to distribute more 
and more uniformly over producers until, for n — n c the distribution becomes uniform. For n < n c the economy 
is inefficient as the uneven distribution of demands generates exploitable profit opportunities. For n > n c the 
economy is instead efficient. Notice that results arc indeed independent of initial conditions in the inefficient 
phase, while when n < n c the theory developed for the ergodic regime ceases to describe the steady state 
correctly. As a matter of fact, the steady state for "flat" initial conditions Pi(0) = for all i and "biased" initial 
conditions p,(0) =0.1 for all i lead to very different regimes from a macroscopic viewpoint, as can be easily 
inferred from the behavior of c. 

This is even more evident when one turns the attention toward fluctuations (see Fig. 2). We show in particular 
the behavior of the quantity 

S = ^E((^-^) 2 ) = A + ^ ( 28 ) 

which also serves as a measure of the total amount of money invested by consumers. We see that in the inefficient 
phase fluctuations are small and well described by the approximate equations derived in the previous section. 




Figure 1. Behavior of H and c versus n for different values of q and S — 2. Markers correspond to results from computer 
simulations, averaged over 100 disorder samples. The dashed vertical line marks the position of the critical point n c . The 
solid lines give the analytic result (valid only for n < n c ). "Flat" and "Biased" refer instead to different initial conditions: 
Pi(0) = and Pi(0) =0.1, respectively. For H only results for q = 1/2 are shown for different initial conditions, for 
simplicity. 



When the economy becomes efficient, however, the dependence on initial conditions may drive the system to 
both states with large price fluctuations where (S ~ n), which are rather undesirable, and states with small 
fluctuations (where £ ~ l/n>). This can be interpreted with the following mechanism (borrowed from [10]). When 
there are few consumers, many sellers receive small demands and thus the economy presents many profitable 
opportunities. As more and more consumers join the opportunity window shrinks and players may be forced 
to switch bundles repeatedly in the attempt to identify convenient commodities. This leads to the increase of 
fluctuations and ultimately to a loss of day-by-day cost certainty. 

Coming finally to the case of varying S (for which we fix q at 1/2, see Fig. 3), one sees that as the number of 
possible bundles per consumer grows, the phase transition is preserved although the critical point shifts to smaller 
values, indicating that a smaller number of consumer can fill the product space efficiently when consumers have 
more possible choices. However the peculiar behavior of fluctuations in the efficient regime is left unchanged. 

6. CONCLUSIONS 

The model presented here aims at studying how the choices made by adaptive consumers, and in particular how 
they distribute themselves over sellers, affect the resulting price distribution. We have seen that the interplay 
between the two aspects of the problem is rather subtle and definitely worth of further study. Many conclusions 
valid for Minority Games hold also for this model. In particular, it is possible to argue that if consumers may enter 
the market when there are many convenient opportunities and leave it when there is none, the economy would 
self-organize at the critical point n c , which is the most efficient from a collective viewpoint. It is also interesting to 
notice that the regime with H > can be seen as one in which some consumers have identified a convenient seller 
from which they buy preferentially. Then the fact that H > can be interpreted as the formation of some sort of 
stable trading relationship (interestingly accompanied by a higher average price though smaller fluctuations). It 
may be possible to calculate H and check whether the picture described here is realistic directly from empirical 
trading data from some specific markets in which agents trade frequently. Examples that have received some 
attention in the economic literature are markets for perishable goods such as fish. Work along these lines is in 
progress. Finally, the model acquires yet more richness when consumers are given a degree of stochasticity in 
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Figure 2. Behavior of E (see text) versus n for different values of q and S = 2. Markers correspond to results from 
computer simulations, averaged over 100 disorder samples. The dashed vertical line marks the position of the critical 
point n c . The solid lines give the analytic result (valid only for n < n c ). "Flat" and "Biased" refer instead to different 
initial conditions: pi(0) = and Pi(0) = 0.1, respectively. 
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Figure 3. Behavior of _ff and E versus n for different values of S and q = 1/2. Markers correspond to results from 
computer simulations, averaged over 100 disorder samples. Lines are just a guide for the eye. "Flat" and "Biased" refer 
instead to different initial conditions: pi(0) = and pi(Q) = 0.1, respectively 



their decision making process, for example in the form of a finite learning rate, or when the dynamics of demand 
(which is the only side addressed here) is coupled to a supply dynamics generated by producers. In these cases, 
the behavior may depart significantly from that of the standard Minority Game (to some extent, this aspect was 
discussed in [10]). The extension of the present theory to that case will be reported elsewhere. 
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